HE ZHU He Zhu

Short Biography: He Zhu received the B.S. degree in Computer Science from Nankai University, China, in 2020, and the M.S. degree in Information Science from Hokkaido University, Japan, in 2024. He is currently pursuing the Ph.D. degree with the Graduate School of Information Science and Technology at Hokkaido University. His research interests include Visual Question Answering, Medical Image Analysis, and Persionlized Federated Learning. He works on state-of-the-art AI-based techniques for medical images. He is a student member of IEEE.

E-mail: zhu[at]lmd.ist.hokudai.ac.jp

Google Scholar IEEE Explore LinkedIn

Biography Publication Fellowship and Awards

 

Biography

  • 2024/04 ~ Present Hokkaido University, Ph.D. in Information Science
  • 2022/04 ~ 2024/03 Hokkaido University, M.S. in Information Science
  • 2021/10 ~ 2022/03 Hokkaido University, Research Student
  • 2016/09 ~ 2020/06 Nankai University, B.S. in Computer Science

 

Publication

Journal

  1. He Zhu, Ren Togo, Takahiro Ogawa, Miki Haseyama, “Multimodal natural language explanation generation for visual question answering based on multiple reference information,” Electronics, vol. 12, no. 10, 2183, 2023. [Paper] (2022 IF: 2.9)
  2. He Zhu, Ren Togo, Takahiro Ogawa, Miki Haseyama, “Diversity learning based on multi-latent space for medical image visual question generation,” Sensors, vol. 23, no. 3, 1057, 2023. [Paper] (2022 IF: 3.9)

International Conference

  1. He Zhu, Ren Togo, Takahiro Ogawa, Miki Haseyama, “Prompt-based personalized federated learning for medical visual question answering,” IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1821-1825, 2024. [Paper]
  2. He Zhu, Ren Togo, Takahiro Ogawa, Miki Haseyama, “Interpretable visual question answering referring to outside knowledge,” IEEE International Conference on Image Processing (ICIP), pp. 2140-2144, 2023. [Paper]
  3. He Zhu, Ren Togo, Takahiro Ogawa, Miki Haseyama, “A medical domain visual question generation model via large language model,” IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW), pp. 163-164, 2023. [Paper]
  4. He Zhu, Ren Togo, Takahiro Ogawa, Miki Haseyama, “A multimodal interpretable visual question answering model introducing image caption processor,” IEEE Global Conference on Consumer Electronics (GCCE), pp. 805-806, 2022. [Paper]

Domestic Conference

  1. 朱 赫, 藤後 廉, 小川 貴弘, 長谷山 美紀, “物体検出モデルに基づく視覚表現を用いた解釈可能なVisual Question Answeringモデルに関する検討,” 映像情報メディア学会技術報告, vol. 47, no. 6, pp. 17-21, 2023.

 

Fellowship and Awards

  1. 似鳥国際奨学財団 北海道みらいIT人材奨学金 奨学生 (2023/04 ~ 2024/03) [Link]
  2. 映像情報メディア学会 優秀研究発表賞 (2023/12)